Massive Data Streams Research: Where to Go
نویسنده
چکیده
This phenomenon has challenged how we store, communicate and compute with data. Theories developed over past 50 years have relied on full capture, storage and communication of data. Instead, what we need for managing modern massive data streams are new methods built around working with less. The past 10 years have seen new theories emerge in computing (data stream algorithms), communication (compressed sensing), databases (data stream management systems) and other areas to address the challenges of massive data streams. Still, lot remains open and new applications of massive data streams have emerged recently. We presents an overview of these challenges.
منابع مشابه
Scaling Up for High Dimensional Data in Data Stores and Streams
The data in engineering and science has been on a massive scale and stored in gigantic storage devices. The data is moved in and out in the form of data streams. Data storage levels are reaching Yottabytes in terms of storage. Science and engineering transforms such data into rich and resourceful data. Intensive methods have been researched for high dimensionality. Science also uses high speed ...
متن کاملApproximate Geometric Query Tracking over Distributed Streams
Effective Big Data analytics pose several difficult challenges for modern data management architectures. One key such challenge arises from the naturally streaming nature of big data, which mandates efficient algorithms for querying and analyzing massive, continuous data streams (that is, data that is seen only once and in a fixed order) with limited memory and CPU-time resources. Such streams ...
متن کاملQuerying Distributed Data Streams - (Invited Keynote Talk)
Effective Big Data analytics pose several difficult challenges for modern data management architectures. One key such challenge arises from the naturally streaming nature of big data, which mandates efficient algorithms for querying and analyzing massive, continuous data streams (that is, data that is seen only once and in a fixed order) with limited memory and CPU-time resources. Such streams ...
متن کاملWhere Are the RDF Streams?: On Deploying RDF Streams on the Web of Data with TripleWave
RDF Stream Processing (RSP) bridges the gap between semantic technologies and data stream systems. Although a number of RSP systems have been recently proposed, no RDF streams are actually made publicly available on the Web. To cope with this, RSP engines require ad-hoc wrappers in order to be fed from non-RDF streams available on the Internet. In this paper we present TripleWave: an approach f...
متن کاملChapter 9 MINING TEXT STREAMS
The large amount of text data which are continuously produced over time in a variety of large scale applications such as social networks results in massive streams of data. Typically massive text streams are created by very large scale interactions of individuals, or by structured creations of particular kinds of content by dedicated organizations. An example in the latter category would be the...
متن کامل